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Abstract 

We consider the complexity of counting homomorphisms from an r-uniform hypergraph G 
to a symmetric r-ary relation H. We give a dichotomy theorem for r > 2, showing for which H 
this problem is in FP and for which H it is #P-complctc. This generalises a theorem of Dyer 
and Greenhill (2000) for the case r = 2, which corresponds to counting graph homomorphisms. 
Our dichotomy theorem extends to the case in which the relation H is weighted, and the 
goal is to compute the partition function, which is the sum of weights of the homomorphisms. 
This problem is motivated by statistical physics, where it arises as computing the partition 
function for particle models in which certain combinations of r sites interact symmetrically. In 
the weighted case, our dichotomy theorem generalises a result of Bulatov and Grohe (2005) for 
graphs, where r = 2. When r = 2, the polynomial time cases of the dichotomy correspond simply 
to rank-1 weights. Surprisingly, for all r > 2 the polynomial time cases of the dichotomy have 
rather more structure. It turns out that the weights must be superimposed on a combinatorial 
structure defined by solutions of an equation over an Abelian group. Our result also gives a 
dichotomy for a closely related constraint satisfaction problem. 



1 Introduction 

We consider the complexity of counting homomorphisms from an r-uniform hypergraph G to a 
symmetric r-ary relation H. We will give a dichotomy theorem for r > 2, showing that counting is 
in polynomial time for certain H and is #P-complete for the remainder. Moreover our dichotomy 
is effective, meaning that there is an algorithm that takes H as input and determines whether 
the counting problem is polynomial time solvable or whether it is #P-complete. This generalises 
a theorem of Dyer and Greenhill [10] for the case r = 2, which corresponds to counting graph 
homomorphisms or //-colourings. 

'Partly funded by the EPSRC grant "The complexity of counting in constraint satisfaction problems". Some of 
the work was done while the authors were visiting the "Combinatorics and Statistical Mechanics" programme of the 
Isaac Newton Institute for Mathematical Sciences, University of Cambridge. 
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Our dichotomy extends to the case in which the relation H is weighted, and we wish to compute 
the partition function, which is the sum of weights of all homomorphisms. Here our dichotomy 
theorem extends a result of Bulatov and Grohe [3j for the case of graphs, r = 2. In the graph 
dichotomy, the polynomial time cases correspond simply to weights which form rank-1 matrices. 
Surprisingly, for all r > 2, the polynomial time solvable cases are more structured. It turns out 
that the weights must be superimposed on a combinatorial structure defined by solutions of an 
equation over an Abelian group. We note that this already appears in a disguised form in the case 
r = 2. The bipartite case, which has no obvious analogue for r > 2, corresponds to the equation 
ct\ + ct2 = 1 over the group Z2. 

A motivation for considering this question comes from statistical physics. Identifying V{G) with 
a set of sites and D with a set of q spins, the quantity that we wish to compute, Z 9 (G), can be 
viewed as the partition function of a statistical physics model in which certain sets of r sites interact 
symmetrically, and their interaction contributes to the Hamiltonian of the system. The partition 
function then gives the normalising constant for the Gibbs distribution of the system. The sets of 
r interacting sites are the edges of G. (Sometimes, an edge of size greater than 2 is referred to as 
a "hyperedge", but we do not use that terminology here.) Clearly, the sites in an edge should be 
distinct, although their spins need not be. In this application, the edges would usually represent 
sets of sites which are in close physical proximity. 

1.1 Notation and definitions 

An r -uniform hypergraph G was defined by Berge pQ to be a system of subsets of a set V(G), where 
n = \V{G)\, in which each subset has cardinality r. The elements of V{G) are the vertices of the 
hypergraph, and the subsets are its edges. Then E(G) denotes the edge set of G. Let M = \E(G)\. 
Note that the edges of G are distinct sets, otherwise the set system is a multihypergraph. Note also 
that the edges are sets, not multisets, otherwise the multiset system has been called a hypergraph 
with multiplicities [12]. Note that "r-uniform hypergraph with multiplicities" is synonymous with 
"symmetric r-ary relation". A loop is then a (multiset) edge in which all r vertices are the same [12J. 
Therefore a simple graph G = (V, E) (having no loops or parallel edges) is a 2-uniform hypergraph, 
a graph with parallel edges is a 2-uniform multihypergraph, and a graph with loops is a 2-uniform 
hypergraph with multiplicities, or a symmetric binary relation. 

Let D be a finite set with q = \D\. We will assume q > 2, since the cases q < 1 are trivial. For 
some r > 3, we consider a symmetric r-ary function g with domain D and codomain a set of real 
numbers. The codomain we will choose is the set of nonnegative algebraic numbers, Q~ . Thus Q 
denotes the field of all algebraic numbers, and we let Q denote the positive numbers in Q. Our 
principal reason for this choice is that arithmetic operations and comparisons on such numbers can 
be carried out exactly on a Turing machine. See, for example, [6]. Moreover, since our analysis 
is entirely concerned with polynomial equations, it is natural to work in Q, which is the algebraic 
closure of the rational field Q. 

Given a symmetric function g : D r — > Q~° and an r-uniform hypergraph G as input, the partition 
function associated with g is 

Z9 ( G )= E II ff(<rW,..,<r(«r)). (1) 

a:V(G)^D ( U1 ,...,u r )£E(G) 

Eva I (g) is the problem of computing Z 9 {G), given the input G. Each choice for the function g leads 
to a computational problem which we will call Eva I (<?), and we may ask how the computational 
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complexity of Eval(g) varies with g. 

We may view ([1]) as the evaluation of a multivariate polynomial function of the weights g(x) 
{x £ D r ). If there are N different irrational weights ^1,^2, • • -£jv, we can perform the necessary 
computations in the field Q(£i,£2j • • -£iv)- It is known that this field is equivalent to Q(0) for a 
single algebraic number 9, the primitive element, and an algorithm to determine 9 exists. We do not 
need to consider the efficiency of this algorithm, since N is a constant. The standard representation 
of a number in Q(0) is a constant degree polynomial in 9 with rational coefficients. Arithmetic 
operations in Q(0) can be carried out in this representation. For details, see [6]. We assume 
that g is pre-processed so that all weights are given in this standard representation. Some of our 
intermediate reductions seemingly require computing in larger algebraic number fields. This is true 
even if all original weights are rational, and justifies our choice of Q as the codomain of g. We 
will suppose, without further comment, that the necessary algebraic numbers are adjoined to Q(#) 
as required. In any case, we compute only in numbers fields which have constant degree over Q. 
Despite this increase in field size during our reductions, we will show that the resulting algorithm 
for the polynomial time solvable cases can perform its computations entirely within Q(0). Note 
that the exact representation in Q(9) can also be used to compute in FP any polynomial number 
of bits of the binary expansion of Z g (G), if this is required. 

It is easy to bound the number of different monomials which occur in ([1]). Suppose there are K 
nonzero weights, for some < K < (-H . Then the polynomial ([T]) has at most 

monomial terms, which is polynomial in the size of the input. Each monomial can be computed 
exactly in FP, working in the field Q(#). The coefficient of each monomial is an integer, which is 
easily seen to be computable in #P. The nondeterministic Turing machine guesses a : V(G) — > D, 
computes the term in ([T]) as a monomial in the weights and accepts if it is the chosen monomial. 
Therefore Z 9 (G) can be computed exactly in FP* P as an element of Q(0). Consequently, showing 
that Z 9 (G) is #P-hard implies that it is complete for FP* P . We make use of this observation 
below. 

It will be helpful to describe a constraint satisfaction problem which is closely related to Eva I (g). 
An instance I of #CSP(g) consists of a set V(I) = {vi, . . . , v n } of variables and a multiset E(I) of 
constraints. Each constraint has a scope, (u±, . . . , u r ), which is a tuple of r variables. The partition 
function Z 9 (I) is given by 

29 W= E II »(o-(« 1 ),...,<r(«r)). (2) 

u:V{I)^D ( Ul ,...,u r )£E(I) 

Thus, every instance G of Eval (5) can be viewed as an instance of #CSP(g) by taking the vertices 
as variables and the edges as constraint scopes. The value of the partition function that gets 
output is the same in both cases. Thus, we have a trivial polynomial time reduction from Eval(g) 
to #CSP(g). The opposite is not necessarily true, because a constraint scope (ui, . . . ,u r ) of an 
instance I of #CSP(g) might not be an edge - the same variable might appear more than once 
amongst m, . . . ,u r . Also, the same scope might appear more than once in E(I). So an instance 
/ of #CSP(g) might not be a properly-formed instance of Eval(g). In fact, / is a multihypergraph 
with multiplicities in general, rather than a hypergraph. Nevertheless, our main result applies also 
to the problem #CSP(g) — see Corollary [3J We note that both the Eval(<7) and the #CSP(g) 
problems have been studied extensively. 
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The problem #CSP(g) may be generalised to the case in which the parameter g is replaced by a 
set of functions V. If F is a set of functions (of various arities) from D to Q~° , then #CSP(T) is 
the problem of computing the partition function of an instance I in which each constraint with 
r-ary scope specifies a particular r-ary function from T which should be applied to the scope in the 
partition function. See [1] or [9] for further details. If the functions in T are not required to have 
any additional properties, like symmetry or given arity, #CSP(r) is actually no more general than 
#CSP(g), at least from the viewpoint of computational complexity. It can be shown that the two 
problems have the same complexity under polynomial time reductions [5J. Note, however, that the 
reduction from #CSP(T) to #CSP(<?) given in [5] does not preserve symmetry. So this equivalence 
does not permit us to replace a family T of symmetric functions by a single symmetric function g. 
This holds even in the simplest possible case in which T has two unary functions. Hence, restricted 
to symmetric functions, #CSP(T) may be a more general problem than #CSP(g), but we do not 
consider it further here. 

1.2 Previous work 

The computational complexity of problems of the type we consider here was first investigated by 
Dyer and Greenhill |10j . who examined the complexity of Eva I (g) in the special case in which r = 2 
and g : D 2 — > {0, 1}, so g is equivalent to a symmetric relation on D. This is the problem of 
counting homomorphisms from an input simple graph G to a fixed (undirected) graph H, possibly 
with loops, where the function g represents the adjacency matrix of H. They showed that there is 
a polynomial time algorithm when each connected component of H is either a complete unlooped 
bipartite graph or a complete looped graph. In all other cases the counting problem Eva I {g) is 
#P-complete. 

More generally, Bulatov and Grohe [3] considered the complexity of #CSP(g) when g is a symmetric 
binary function on D. If the input is a simple graph G, we can think of this as counting weighted 
homomorphisms from G to an undirected graph H with nonnegative edge weights. The function 
g is equivalent to the weighted adjacency matrix A of H. If H is connected, then we say that the 
matrix A is "connected" , otherwise the "connected components" of A correspond to the connected 
components of the graph H. Similarly, we say that A is bipartite if and only if H is bipartite. In 
this setting, Bulatov and Grohe [3] established the following important theorem, which is central 
to our analysis. 

Theorem 1 (Bulatov and Grohe). Let A be a symmetric matrix with non-negative real entries. 

(1) If A is connected and not bipartite, then Eval(A) is in polynomial time if the row rank of A 
is at most 1; otherwise Eval(Aj is #P-hard. 

(2) If A is connected and bipartite, then Eval(A) is in polynomial time if the row rank of A is at 
most 2; otherwise Eval(A) is #P-hard. 

(3) If A is not connected, then Eval(A) is in polynomial time if each of its connected components 
satisfies the corresponding condition stated in (1) or (2); otherwise Eva I (A) is j^-P-hard. 

Although Theorem [1] is stated for real numbers, we will make use of it only in the case of the 
algebraic numbers, since it is not clear to us how it extends to the models of real computation 
discussed in [4J. We prefer to work entirely in the standard Turing machine model of computation, 
though there may well be models of real computation in which Theorem Q] is valid. For algebraic 
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numbers, which include the rationals, all the arithmetic operations and comparisons required in 
our reductions, and those of [3], can be carried out exactly in the Turing machine model. 

In the unweighted case of #CSP(T), where all functions in T have codomain {0, 1}, Bulatov [2] has 
recently shown that there is a dichotomy between those T for which #CSP(T) is polynomial time 
solvable, and those for which it is #P-complete. The dichotomy can be extended to the case in 
which all functions in T have codomain Q- , the nonnegative rational numbers, using polynomial 
time reductions [5J. However, the reductions involved do not seem to extend to functions with 
codomain Q~°. 

Establishing the existence of a dichotomy for ^CSP(r) is a major breakthrough. Nevertheless, 
the techniques of [2] shed very little light on which T render #CSP(T) polynomial time solvable, 
and which T render it #P-hard. In the current state of knowledge, Bulatov's dichotomy [2] is not 
effective, and its decidability is an open question. 

1.3 The new results 

Our main theorem, TheoremEl gives a dichotomy for the case in which T contains a single symmetric 
function g. For this problem, we identify a set of functions g for which Eva I (g) is computable in 
FP, and we show that, for every other function g, Eval(g) is complete for FP* P . 

We examine both Eval(g) and #CSP(g) in this setting, and give an explicit dichotomy theorem 
in both cases, extending the theorems of Dyer and Greenhill [TU] and Bulatov and Grohe [3j to 
r > 2. In the r > 2 case, the problem Eval(g) can be understood as evaluating sums of weighted 
homomorphisms from an input hypergraph G to a fixed weighted hypergraph with multiplicities H. 
The weights of edges in H are represented by the function g. 

As in the r = 2 case, there is a dichotomy, but this time some nontrivial algebraic structure is 
involved in the classification. The polynomial time solvable cases have rank-1 weights as before, 
but this time, these weights are superimposed on a combinatorial structure defined by solutions to 
an equation over an Abelian group. In particular, Eva I (g) is polynomial time solvable if and only 
if each connected piece of the domain factors as the cartesian product of two sets A and [s]. Then, 
for any a±, . . . ,a r 6 A and i%, . . . , i r € [s], the value of g((ai,ii), . . . , (a r , i r )) is equal to unless 
(ax, . . . , Q.r) is a solution to an equation in an Abelian group with domain A. In that case, the value 
g((ai,ii), . . . , (a r ,i r )) is just the product of some positive weights , . . . , Aj r . A "connected piece" 
of the domain is defined as follows: two elements z and z' are linked if there are some Z2, ■ ■ ■ , z r _\ 
such that g(z, Z2, z r -i, z') > 0. In general, two elements z and z' are connected if there is a 
sequence of c elements Zi, . . . , z c with z = z\ and z' = z c such that each pair (zi, -Zj+i) is linked. 
See Theorem [2] for details. 

In fact, it turns out that there is only one way to factor the connected component of the domain 
into A and [s] (see Theorem SJ. Thus, there is a straightforward algorithm that takes g and 
determines whether Eval(g) is in FP or is ^P-hard. See Sections [7] and El 

Our result is in a similar spirit to the result of Khma, Larose and Tesson which gives a 
dichotomy for the problem of counting the number of solutions to a system of equations over a 
fixed semigroup. Although our application is rather different, parts of our proof draw inspiration 
from the proof of their theorem. 
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2 The main theorem 



For 1 < k < r, we will define 

fW(z h ...,z k ) = g(zi,...,z r ). 

Z k + 1 ,...,Z r <=D 

Note that is symmetric and that . . . , z r ) = g(z\, . . . ,z r ). Let 

RW ={( Zl ,...,z k ) :fW(z 1 ,...,z k ) >0} 

be the relation underlying jfW. We will view relations either as subsets of -D fc or as functions 
D k — > {0, 1} according to convenience. To avoid trivialities, we assume that is the complete 
relation, i.e., that all elements of D participate in the relation; if not, an equivalent problem can 
be formed by simply removing the non-participating elements from D. For any k < r we have 
f [k] (zi, . . . ,z k ) = E« fc+l6j D/ [fc+1] (^l>--->^+i) so if /c > 2 then (z 1 ,z 2 ) G R [2] is equivalent to 
"there exist zs,...,z k such that (zi, . . . , z k ) G Let = be the equivalence relation which 

is the transitive, reflexive closure of RPk The domain D is partitioned into equivalence classes 
("connected components") D = D\ U • • • U D m by =. 

We will use the following notation: We will let I range over [m] , and use it to refer to a particular 
connected component Dg. When applied to any function as a subscript, it denotes the restriction 
of that function to the relevant connected component. For example, fl : {Dg) k — > Q~ denotes 
the restriction of to the ith. connected component D^. Likewise, ge is the restriction of g to Dg. 
Given the definition of =, it is clear that /' fc l = © • • • © fm (meaning that f^(z\, . . . , z k ) = 
unless zi, . . . ,z k are all in the same connected component). We can now state the main theorem. 

Theorem 2. Let g : D r — > Q~° be a symmetric function with arityr > 3 and connected components 
D\, . . . ,D m as above. If g satisfies the following conditions, for all t £ [m], then Eval(g) is in FP. 
Otherwise, Eval(g) is complete for FP* P . Moreover, the dichotomy is effective. 

• There is a set Ai and a positive integer se, such that is the Cartesian product of Ag and 
[s{\ (which we write as Di = A^ x [s(\). 

• There are positive constants : i £ [se]} and a relation Si C Ag such that, for a\, . . . , a r G 
Ag and i±, . . . , i r G [sg], 

gi((ati,h), . . . , (a r , i r )) = ^g,h ■■■\g )ir Si(ai,...,a r ). 

• There is an Abelian group {Ag,+) and an equation a± + • • • + a r = a (for some element 
a G Ag) which defines Sg in the sense that (ai, . . . , ct r ) G Sg if and only if a\ + ■ ■ ■ + a r = a. 

The algorithm used in the polynomial time solvable cases of Theorem [2] still works if the instance 
is a CSP instance rather than a hypergraph. Thus, we have the following corollary. 

Corollary 3. Let g : D r — > Q~° be a symmetric function with arity r > 3 and connected components 
D\, . . . , D m as above. If g satisfies the conditions in Theorem^ for all I G [m], then f^CSP(g) is 
in FP. Otherwise, #CSP(g) is complete for FP* P . Moreover, the dichotomy is effective. 

Some of the #P-hardness proofs in the proof of Theorem [2] could be simplified if we allowed ourselves 
a general CSP instance rather than a hypergraph, but we refrain from using this simplification 
in order to obtain the strongest-possible result (that is, to obtain Theorem [2] rather than just 
Corollary [3]) . 
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3 A restatement of the main theorem 



We introduce some further notation and restate the main theorem more compactly. Along the way 
we gather more information, e.g., about the factorization Dp = An x [se]. 

We define the equivalence relation ~/% on D as follows: z\ ~fc z[ iff there is a A in Q >0 such that, 
for all Z2, ■ ■ ■ , Zk G D, /M (zi, Z2, ■ ■ ■ , z k ) = A/M (z[, Z2, ■ ■ ■ , z k ). Note that ~^ refines Also, 
~2 refines = since, for any z\,z[ G D, z\ ^2 z '\ implies that there exists Z2 satisfying R\ 2 \z\, Z2) 
and R®( z[, Z2), which in turn implies z\ = z[. 

Let [x][ fe l = {y : y r^ k x} be the equivalence class of x under Choose a unique representative 
g[k] £ Thus x^ = y[ fc l if and only if x ~ fc y. Let A^ = {iW : x G D}. Let A^ denote the 

restriction of A^ to D e so A^ ={#1 :sgZ)^}. 

Note that i?M is consistent with in the sense that R^{z\, . . . , z k ) = R}- k \zf\ . . . , z k ), so we can 
quotient B)® by to get a relation = R^/^ k on A^. Note that 5^ is just the restriction 
of i?M to AM. Also, sf 1 is the restriction of iif 1 to A? 1 . 

Suppose is in the range 2 < k < r. We say that g is k- factoring if the following conditions hold 
for every I G [m]. 

1. There is a positive integer sf^ such that ZX« is the Cartesian product of A^ and [s^] (which 
we write as D e = Af ] x [sf ] ]). 

2. There are positive constants {xfj : i G [sf^]} such that, for ai, . . . , at G Aj, fc ' and i\, . . . ,i k G 



/] fc] ((ai,ii),.. . , (ajfc,ifc)) = A^ • • • A^ fe Sf ] (ai, . . . ,a fc ). 



If g is fc-factoring then we say that g is k-equational if, for every ^ G [to], there is an Abelian group 
(A^, +) and an equation a± + • ■ • + a k = a (for some element a G A^) which defines 5] in the 
sense that (a±, . . . , a k ) G Sl^ if and only if a\ + ■ ■ ■ + a k = a. 
Our main theorem (Theorem [5]) can be restated as follows: 

Theorem 4. Let g : D r Q~° be a symmetric function with arity r > 3. If g is r-factoring 
and r-equational then Eva 1(g) is in FP. Otherwise, Eva 1(g) is complete for FP* P . Moreover, the 
dichotomy is effective. 

Before proving Theorem U we prove that it is equivalent to Theorem El First, it is easy to see that 
if g satisfies the conditions of Theorem H] (that is, it is r-factoring and r-equational) then it also 
satisfies the conditions of Theorem [2] (taking Ag to be A^\ S£ to be sf \ and A^j to be A[ r ]). The 
other direction is a little less obvious. Suppose that g satisfies the conditions of Theorem [2j Fix any 
I G [to]. From the first condition of Theorem [2j we have Di = Ag_ x [sg\. Consider any a, a' G An 
and any i, i! G \sg\. We will argue that (a, i) ~ r {a! , i') if and only if a = a' . First, suppose a = a'. 
Then, for any «2, . . . , a r G Ai and 12, ■ ■ ■ , i r G \si\, the second condition of Theorem [5] gives 

g e ((a,i), (02,22), ■ • • , («r, ir)) = M,ih,i 2 ■ ■ ■ ^e,i r Se(a,a 2 , ... ,a r ) 

and 

ge((a',i'), (025*2)) ■ •, (a r ,i r )) = \i'^e,i 2 ' ' ' ^e,i r Se(a, a 2 , . . . ,a r ), 
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so, by the definition of ~ r , (a, i) ~ r (a', i'). Next, suppose (a, i) ~ r (a', £'). Then there is a positive 
constant A such that, for any 02, • • • , cx r £ At and £2, . . . , v £ [ S ^L 



M,iM,i 2 ' ' ' M,i r Se{a, U2, ■ ■ ■ , a r ) = AA^j/A^ ) i 2 • • • X^S^a , 02, ... , a r ). 

We conclude that, for any 02, ■ ■ ■ ,a r G Ag, St(a, ot2, ■ ■ ■ , a r ) = St(a', 02, ■ ■ ■ , a r ). By the third 
condition in Theorem [21 we conclude that a = a'. We have now shown that (ct,i) ~ r (a',i') if and 
only if a = a . This implies that we can take the set A e of unique representatives to be Ai and we 
can take s\ to be S£. Then, taking A^ to be A^j, g is r-factoring and r-equational (so it satisfies 
the conditions of Theorem Hj). So we conclude that the two theorems are equivalent. 

Now that we have shown that Theorem 0] is equivalent to Theorem [21 the rest of the paper will focus 
on proving Theorem^ The case r = 2 is that of weighted graph homomorphism, which was analysed 
by Bulatov and Grohe [4j. Theorem U] is true also when r = 2. In this situation, it could be viewed 
as a restatement of their result. Note, however, that "2-equational" is a restricted notion that 
places severe constraints on the groups (Af,+) that can arise. Indeed the only possibilities that 
are consistent with the connectivity relation = are the 2-element group C2 ( "bipartite component" ) 
and the trivial group ("non-bipartite component"). 

It will follow from the proof of Theorem [JJ (assuming that #P % FP) that a symmetric function g 
of arity r > 3 that is r-factoring and r-equational is ^-factoring and fc-equational for all 2 < k < r. 
In fact, the Abelian groups (A^\ +) will all be trivial for k < r: non-trivial group structure is only 
possible at the top level. As a first step in the proof of Theorem 0] we verify that non-trivial group 
structure is only possible at the top level. 

Lemma 5. Let g : D r — > Q~° be a symmetric function with arity r > 3. If g is k-factoring and 
k-equational for some k < r then for every £ € [m] there are positive constants {xf'j : i £ Df\ such 
that, forii,...,ik £ D e , 

pMa,- ;.\ - \[*] 



jf(*i,..^) = A^...A£ 



Proof, g is /c- factoring so Dg = A^ x [sp] and, for a%, . . . , a k £ and i%, . . . ,i k £ [s]j ], 



ff\(ai,ii), . . . , (a k ,i k )) = ■ ■ ■ xfj k sf \at, . . . , a k ). 

Ik] [fel-i 

Now consider cx\, . . . , a k+ \ £ A e and i\, . . . , i k+ i £ [s e ]. If 



ff +1 \(ai,ix), (a k+1 ,i k+1 )) > 

then 

ff\(ai,ii), (a k -i,ik-i), (a k ,i k )) > 

and 

f\ k \(ai,ii), . . . , (afc_i,£fc_i)> («fe+i, h+i)) > 0. 

So since g is /c-equational, 

a± + h afc_i + a k = a\ + h a k _\ + a k+ i = a 

so a k = a k+1 . By symmetry, u x = • ■ ■ = 
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Now if ((a, i), (J3, j)) G then there exist a 2 , ■ ■ ■ ,oc k an d i-2, ■ ■ ■ , i k such that 

f^((a,i),(f3,j),(a 2 ,i 2 ),...,(a k ,i k )) > 0. 

Thus, a = /3. Taking the transitive closure, we note that if (a,i) and (/3,j) are both in Di then 
a = /3. Hence \Af ] \ = 1 so D t = [sjf ] ]. □ 

Our strategy for proving Theorem H] is now as follows. Suppose Eval(g) is not #P-hard. We 
prove, for k = 2,3, ... ,r in turn, that g is ^-factoring and A;-equational. For k = 2 this follows 
straightforwardly from Theorem [TJ The inductive step from k to k + 1 is where the work lies, but 
Lemma [5] plays a role. Ultimately, we deduce that g is r-factoring and r-equational. Conversely, if 
g is r-factoring and r-equational, the partition function Z 9 may be computed in polynomial time 
using existing algorithms for counting solutions to systems over Abelian groups, and hence Eval(g) 
is polynomial time solvable. 

4 Preliminaries 

An easy observation that will be frequently used in the rest of this paper is the following. 
Lemma 6. //Eval(/[ fc l) is #P-hard, for some 2 < k < r, then so is Eva I (5). 

Proof. An instance of Eval(/^) is a fc-uniform hypergraph. Simply pad each edge e = (u\, . . . , Uk) 
to size r by adding r — k fresh vertices as follows: [u\, . . . , u^, z k+i> • ■ ■ > ^v) - ^ ^ sasy to verify that 
this is a polynomial time reduction from Eval(/^) to Eval(g). □ 

Another easy observation is that the partition function Z 9 (G) factorises if G is not connected. So 
we may assume henceforth that the instance hypergraph G is connected. 

For z G D let A'jf' be defined so that, for all z 2 , ■ ■ ■ , Z}~ G D, 

f^(z,z 2 ,...,z k )=\ff^ k \z 2 ,...,z k ). 

(Recall from the definition of ~fc that A'^ does not depend on z 2 ,...,z k .) Then, by symmetry, we 
have 

/W(^.--^) = A / £ I ---At I / W (4* 1 ,---.# 1 )- (3) 

Define 

/W ( Z1 ,Z[) = Yl f [k] (*1> • • • , Z k )f lk] (Z[,Z 2 , ...,Z k ). 

Let be the (symmetric) binary relation underlying f^. It will turn out that Rffl and ~jt 
coincide when g is not #P-hard. 

For the purposes of this paper, a symmetric relation R C A k is said to be a Latin hypercube if, for 
all a\, . . . , a k -\ € A, there exists a unique € A such that (ai, . . . , a k ) G i?. Note that symmetry 
implies similar statements with the a^s permuted. This definition specialises to the familiar notion 
of Latin square if we take k = 3 and think of ai, a 2 and az as ranging over rows, columns and 
symbols, respectively. For k > 3 it is consistent with the existing, if less familiar, notion of Latin 
(k — l)-hypercube. 

We use the following interpolation result, which is (TUl Lemma 3.2] 
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Lemma 7. Let rji,...,r] m be known distinct nonzero constants Suppose that we know values 
Z±, . . . , Z m such that Z p = YllLi ItfTt f or 1 — P — m - The coefficients 71, . . . , 7 m can be eval- 
uated in polynomial time. 

Lemma [7] has the following consequence, since if we have rji = rjj below we can combine 7^ and jj 
into ji + 7j. 

Corollary 8. Let rji, ... , rj m be known nonzero constants Suppose that we know values Z\, . . . , Z m 
such that Z p = ^W? f or ^ — P — m - The value Zq = Y^eLi ^ can ^ e computed in polynomial 

time. 

As mentioned earlier, the base case (k = 2) in the proof of Theorem |4] will follow from the result 
of Bulatov and Grohe [4J. They examined the complexity of #CSP(g) and there is no immediate 
polynomial time reduction from #CSP(g) to Eval(g). The next lemma provides such a reduction 
for the case that we require. 

Lemma 9. Suppose h : D 2 — > Q~° has connected components D±, . . . , Dg, and underlying rela- 
tion Rh- Suppose also that Rh has no bipartite components. If the restriction hi of h to any 
component Di is not rank 1, then Eval(/i) is #P-hard. 

Proof. Let / be an instance of #CSP(/i). View / as a multigraph with possible loops and parallel 
edges. Form the graph G as the "2-stretch" of /; that is to say, subdivide each edge of I by 
introducing a new vertex. Note that G is a simple graph without loops. Define the symmetric 
function : D 2 -> Q"° by h^(x,y) = ^ zeD h(x, z)h(y, z). Note that Z h {G) = Z h(2 \l), and 
hence #CSP(/j( 2 )) reduces to Eval(/i). 

Suppose Eval(/i) is not #P-hard. Then #CSP(h^) is not #P-hard. By [U Thm 1(1)], h^ 2 \ viewed 

(2) (2) 
as a matrix, is a direct sum of rank-1 matrices; i.e., each h\ has rank 1. But each h\ is the "Gram 

matrix" of hi (the product of hi, viewed as a matrix, and its transpose), and it is a elementary fact 

that the rank of a matrix and its corresponding Gram matrix are equal [13]. Thus, for all £, the 

restrictions hi of h to Di are rank 1. □ 



5 Factoring 

Lemma 10. Let g : D r — > Q~° be a symmetric function with arity r > 3. Either Eval(/I 2 ') is 
j^-P-hard (which implies that Eval(g) is #P-hard) or g is 2-factoring and 2-equational. 

Proof. First, note that i?I 2 l has no bipartite components: If (£1,2:2) £ then there is a Z3 such 
that (zi, Z2-, Z3) € Bffl. By the symmetry of f^, we find that (z\,z^) and (22,23) are also in R^ 2 \ 
so the component containing z\ and Z2 is not bipartite. 

rol 

Now, by [4] (using Lemma|9|, f\ has rank 1. Thus, there are positive constants {fi z : z € D} such 
that, for every i £ [m] and every z\, Z2 in Dp, the following holds. 

ff ] {z u z 2 ) = n Zl n Z2 . (4) 

12] [2| 

We conclude that all elements in Di are related by ~2, so \A\ \ = 1. Thus, we can take sy = \Di\ 

[21 [21 

and X e z = u z and the trivial equation (since \A £ \ = 1). 
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The parenthetical claim in the statement of this lemma and subsequent ones comes from Lemma H3 

□ 

Lemma 11. Let g : D r — > Q~° be a symmetric function with arity r > 3. Let k be an integer in 
{3, . . . , r}. Suppose that g is (k — 1) -factoring and (k — l)-equational. Either Eval(/t fc l) is ff-V-hard 
(which implies that Eval(<?) is jf-V-hard), or all the following hold: (i) there are positive constants 
{Xz '■ z G D} such that (z\, . . . , z k ) = a£{ ■ ■ ■ A** {z\, . . . , Zk), (ii) for every connected 
component I G [to], the relation is a Latin hypercube, and (Hi) for every £ G [m], the sum 
J2 Z £[a]W A « ^ independent of a G Ag. 

Proof. Assume Eval(/M) is not #P-hard. Fix i G [m] and z±,z[ G Dg. By the Cauchy-Schwarz 
inequality, 

^2 fT\ Z li Z 2i---,Z k )ff\z' l ,Z 2 ,...,Z k )} < 

z 2 ,...,z k eD e 

fl k] ( Zl ,z 2 ,...,z k f £ fP(z[,z 2 ,....^ 



,Zk 

Z2,...,z k £D( Z2,...,z k GDi 



i.e., 



ff ] (z 1 ,z[f<ff ] (z 1 ,z 1 )ff ] (z' l ,z[), (5) 

with equality precisely when z\ ~& z[. Note that the difference between the right-hand-side and 
the left-hand-side in Equation (|5j) can be seen as a 2 by 2 determinant. 

Now Eval(/t fc l) < Eval(/[ fc l) since f^(u,v) can be simulated by a pair of constraints 

f [k] {u, w 2 ,..., w k )f [k] {v,w 2 ,..., w k ) 

using new variables w 2 , . . . ,w k , so Eval(/^) is not #P-hard. has no bipartite components 

since it is reflexive, so by [I] and Lemma El decomposes into a sum of rank-1 blocks. 

When z\ ^ k z[ we have strict inequality in (|5]), which implies 

ff ( Zl ,z[) = Yl / W (^2, • • • , Zk)f [k] (z[,z 2 , ...,z k )=0, (6) 
z 2 ,...,z k eD 

since otherwise f^ would not decompose into rank 1 blocks. 

So for each choice of canonical representatives a 2 , . . . , a k in Ay there is at most one representative 

a\ G A^ such that f^(ai,...,a k ) > 0. There is at least one such representative a± since, by 
Lemma [5l 

f [ft-i] , v _ .[fc-l] 

} g (a 2 ,...,a k ) - \ ia2 ■■■\ ak , 
and the X^ a values are positive. This is part (ii) of the lemma. 

Recall the definition of A'Jf' from Equation (|3|). For a G A^\ let X a denote the sum X a = 
T,ze{a]W ^'P- Similarly, let fl a = Z) z6 [ a ]W ^ej k ~^- Fix z 2 , . . . ,z k G D t . By LemmaEl 

[ fc _i] _ [ fc _i] , . ^ f m (y r ,\ 

A e, Z2 '" x tz k ~ h {z 2 ,...,z k )- 2^ h \ z h---,Zk) 

zi£D e 

- 2^ A z i ' ' ' A zfc h \ z i ' • • • ' z k ) 

z 1 £D e 

_T y[k] s,[k] .[k], Jk] [fcK 

— A Ql A 22 • • • A Zfc J f (Ctl,Z 2 , . . . , Z k ), 
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where cti is the unique representative in A^ such that f^(a±, z£\ . . . , z^) > 0. So for fixed 
ci!2, . . . , a k G Af\ there is a representative ot\ G such that 

Ev^ .[fc-l] .[fc-i] 
" ' ^,22 ' ' ' \z k 

= E E A Ql At ] ---At ] /F(«i,---,^) 

= A ai •••A afc /] fe ' (ai,..., 

Since we have a Latin hypercube (Part (ii) of the lemma) , any of ai , . . . , a k is determined by 
the other k — 1 of them. Thus, we can derive a similar equality omitting any other p, ai on the 
left-hand-side. Now the right-hand-side of the above equality is symmetric in the ay's, and the left- 
hand-side has exactly one aj missing, so by symmetry we conclude p, ai = • • • = p, ak and, further, 

\k] ~ ~ \k] / 

fi aj is constant for aj £ Ay . Moreover, X ai ■ ■ ■ X ak f £ (a±, . . . , a k ) is constant on representatives 
a±, . . . ,a k G A^ with fjfi (ai, . . . , a k ) > 0. That is, for any set of k representatives a[, a' 2 , . . ■ ,a' k G 
A^ with ff^ (a[, . . . , a' k ) > 0, the value of that expression A a / • • • A a / fc /j fe ' (a[, . . . , a' k ) is the same. 

Now define X$ = qA'^/A^M j where q is a constant, depending only on £, to be determined 
below. Then, whenever f\ k \z±, . . .,z k ) > 0, 

M (y \'{k] Ak] ( -[k] [fc]x 

l^i, • • ■ , 2fcJ — A Zl ■ ■ • A Zfc [Z 1 ,--.,Z k ) 

- c <? A zi A W A [zi]M A [z fc ]W ^ i z l '•••' z feJ- 

But 

c t A [zi]W A [z k \wh \ z i i---i z k ) 

is independent of zi,...,Zk (assuming, as we are, that ff\z\, . . . , z k ) > 0), so, by appropriate 
choice of C£, 

fl k] (z 1 ,...,z k ) = \ Zl W---\ z WRf ] (z 1 ,...,z k ). 

The choice of component D(_ was arbitrary, so a similar statement holds for over its whole 
range, as required by part (i) of the lemma. 

Finally, 

J2 \? ] = Ci £ Xf/X a = ce, 

ze[a]W z6[a]W 

establishing part (iii). □ 



Lemma 12. Lei g : D r — > Q >0 be a symmetric function with arity r > 3. Let k be an integer in 
{3, . . . , r}. Suppose that g is (k — l)-factoring and (k — l)-equational. Suppose there are positive 



constants {X [k] : z G D} such that f [k] {z l7 . . . , z k ) = \f} ■ ■ ■ xf k ] RW(zi, ...,z k )- Either Eval(/[ fe l) 



?.s 



#P-hard (which implies that Eval(g) is f^V-hard), or, for every i G [m], the multiset {aP : z G 

h 



[a] } is independent of the choice of a G A^ . 



Proof. In preparation for the proof, consider the unary constraint U (x) applied to a variable x and 
defined as follows: Take k — 1 new variables X2, ■ ■ ■ , x k then add the constraint f^(x,X2, ■ ■ ■ ,x k ). 
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The resulting unary relation U (x) will be used in the reduction that follows. For any I £ [m] and 
a G let ri£ = \A^ \ and C£ = X^efalM ^ (which, by Lemma HU is independent of the choice 
of a € A^). For any z\ G D£, 

U(zi)= E /fW--^) = E AW...AW J Rfl( Z i,... > ^) 

Z2,...,z k £De Z2,...,z k £D£ 

E E ^-Ag 

c, 2 ,..., ate 4 fc, :( 2 f, a2 ,... lQt )e4 fel «ae[« a ]W,... A e[a fc ]W 

= a?i E ( E >$)■■■( E »e 

a 2 ,...,a fc G4 fcl :(4 fe, ,a 2 ,..,a fc )G4 fcl *26[a 2 ]M 7 V **e[a fc ]W 

where the final equality uses part (ii) of Lemma [TT1 

[fcl 

The idea of the proof is to use U to "power up" vertex weights X z . In this way we discover that 
not only is J2 z e[a]W Ai independent of a E but so also is J2 z e[a]W (Az ) 3 for any positive 

integer j. This implies that the multiset of weights on an equivalence class [a]^ is independent of 
a E A? 1 . 

For z\ , . . . , Zk E T>t and j > 1 , define 
and 

...,Zk)=1pzi- ■ -1p Zk Rf\zi, ■ ■ -,Zk). 

Let fcbl = hf®- ■ -®h^. We will give a reduction from Eval(^l) to Eval(/M). Suppose G = (V, E) 
is a ^-uniform hypergraph (an input to Eva\(h^)). For j > 1, the hypergraph G^ is obtained from 
G as follows: for each vertex v in G of degree d v , add (A; — l)(j — new vertices and (j — l)d v 
new edges, each one incident at v and at k — 1 of the new vertices. Then 

Z h ^{G)= E II hf{a{ Ul ),...,a{u k )) 

a:V^D e ( Ul ,...,u k )GE 
a:V^D e ( Ul ,...,u k )GE 

= e iK*)^ 1 ) '" 1 " n t-.A^flf^),..,^)) 

= e n^iv^ 1 ) '^ n /i fci K«i),--.,^)) 

o-:y->D 4 ueV (Mi,...,u fe )GE 

= Z'JV W )- 
Thus (for connected G) 

Z ftlJ, (G) = E Z ^\ G ) = E ^ W (G y] ) = zf W (G®), 
ee[m] te[m] 
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so Eval(/»b'l) < Eval(/W). 

Assume Eval(/^) is not #P-hard. Then Eval(/i^) is not #P-hard for any j > 1. Recall from the 
statement of the lemma that g is (k — l)-factoring and {k — l)-equational. Then from Lemma ITT1 
part (hi), 

= (n}-^- 1 £ (A^ 

ze[a]W zG[a]W 

is independent of a £ ^l[ fc ' for all j > 1. This can only occur if the multiset {A^ : z G is 
independent of a G A e . □ 

We will use the following corollary of Lemmas \10\ [TT1 and [12j 

Corollary 13. Let g : D r — > Q~° be a symmetric function with arity r > 3. Let k be an integer in 
{3, . . . , r}. Suppose that g is (k — l)-factoring and (k — l)-equational. Either Eval(/^) is jf-V-hard 
(which implies that Eval(g) is jfV-hard), or g is k-factoring. 

Proof. By Lemma [TT] part (i) there are positive constants {A^ : z G D} such that 

fW(z 1 ,...,z k ) = \W--.\WRW(z 1 ,...,z k ). 

Fix any £ G [m]. By Lemma fl"2| the multiset {Xz : z G is independent of the choice of 

a G Ay Let si be the size of this multiset. Then Dp = A^ x [s^] giving condition (1) in the 
definition of A;-factoring. Also, if the element z G Di corresponds to the z'th element of the ~& 
class [z][ fc l then the value Xz just depends upon i (and on I) — it is independent of the equivalence 
class We denote this value as xfj. Thus, for ai, . . . , a k G and i±, . . . ,i k G [sf^], 

/f'((ai,n), • • • , (a k ,i k )) = xf^ ■ ■ ■ \fJ k Rf ] (a!, . . .,a k ), 
giving condition (2) in the definition of &:-factoring. □ 

Lemma 14. Let g : D r — > Q~° be a symmetric function with arity r > 3. Let k be an integer in 
{3, . . . , r}. Suppose that g is k-factoring. Then, for every I G [m], 

Z f [ *\G) = kf(G) Z s t\G), 

where 

4 fel (G)= n E( A S)^- (7) 

vev(G) ie[s W] 

Proo/. For G = (V,E), 

Z f?\G)= II fl k] (Hui),r(u l )),...,(a(u k ),r(u k ))) 



E II ^■■■^sfk^ui),...,^)) 

e ( n sf\a( U1 ),...Mu k )))( e nc^w)* 

Z s[ "\g) Af(G). 
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□ 

Lemma 15. Let g : D r — > Q~° be a symmetric function with arity r > 3. Let k be an integer in 
{3, . . . , r}. Suppose that g is (k — l)-factoring and (k — l)-equational. Either Eval(/^) is jf-V-hard 
(which implies that Eval(g) is #P-hard), or Eva\(S^) < Eval(/' fc l). 

Proof. Suppose that G is a connected /c-uniform hypergraph. For any positive integer, p, let 
G 1 ,...,G P be copies of G. Let {v{, . . . ,v J n } be the vertices of G J . Construct G^ by taking 
the union of G 1 ,...,G P along with n(k — l)p new vertices and 2np new edges: For each i € 
[n], t £ [k — 1] and j G \p\ we add a vertex Then we add edges (uf l5 . . . , v? ik _ x , vj) and 

Now by Corollary I13|, g is /c-factoring, so Di = A$ x [s^]. By Lemma [U 

z /W (Gf M) = Af(GW) Z 5 ^(G W )- (8) 

£e[m] 

We now look at the constituent parts of the right-hand-side of Equation ([8]). First, 

W(GM)= E II 5f ] (aK),...,aK)). 

(j: V(GW)^Af 1 h,..«)£B(GW) 

By Part (ii) of Lemma [TTJ S 1 ]^ is a Latin hypercube. So, given the values o-(v{), . . . ,a{v 3 n ), the 
values <r(n| 1 ), . . . ,o-{v? ik _ 2 ) (for i G [n]) can be chosen arbitrarily from A^. Then there is exactly 
one choice for each o-(u J ik _ l ) so that 

(a(u{ 1 ),...,a(ui k _ 1 ),a(vi))ESP. 

Then for j < n to have 

W<i),.-^->(«f modn)+1 ))e# 

we must have cr(vj +1 ) = cr{xPA. (If j = n then 

(,K 1 ),...,a« w ) 1 ^™ d * 1 )) e f 
just ensures v\ = vf so it adds no new constraint.) Thus, 

^(Gi)-^* 1 (wi,-».«»Jk)eB(Ga) 

= | A W|»(*-a)p z 5W (G)i 
Also, using dr(w) to denote the degree of vertex w in hypergraph T, 

a}\gW)= n e ( A ?i) dcb]H 

w£V(GlPl) h6[s W] 

P / \ P 

k [*h 2 



n e war 1 ^ 2 n n e 



Wi 



ye[n] te[fc-i] he [ s 



Ml 
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where the first factor on the right-hand-side is the product over vertices v\ and the second factor 
is the product over vertices 

So Zf [k] (G®) is equal to 

E(n e («) Jo< "' H2 )"( n n e (rfi^^M. 

«G[m] V iG[n] hg[s W] 7 V ie[n] *e[*-l] he[a W] 7 

We can now use Corollary [8] with Z p = Z/ M (G?W), 7^ = Z s t" ] (G) and 

*=(n E(4) w+2 )(n n z (4!i) 2 )i4T M 

Let us take stock. Suppose 5 is not #P-hard and that g is (k — l)-factoring and (k — l)-equational. 
We know by Corollary [13] that g is ^-factoring, and by Part (ii) of Lemma [11] that the various 
relations Si are Latin hypercubes. The final step, the subject of the following section, is to show 
that the latter have additional structure, namely that they are defined by equations over an Abelian 
groups. It will follow that g is fc-equational. 



6 Constraint satisfaction and Abelian group equations 

Let S be an arity-A; relation on a ground set A. Recall our earlier discussion, in Section [TJ on the 
relation between Eval(S') and ^CSP(S'). Every instance G of Eval(S') can be viewed as an instance 
of #CSP(5) by taking the vertices as variables and the edges as constraint scopes. However, we 
noted that the converse is not true, since an instance I of ^CSP(S') might not be a properly-formed 
instance of Eval(5). Nevertheless, by copying variables, we can view an instance / of #CSP(5) as 
being a fc-uniform hypergraph G, together with some binary equality constraints on variables. For 
variables U and W, the constraint = (U, W) is satisfied if and only if o~(U) = a(W). The following 
lemma shows that, in our setting, these equality constraints do not add any real power - they can 
be implemented by interpolation. 

Lemma 16. Let S = S± © • • • © S m be a symmetric k-ary relation on a ground set A, such that 
each Si is a Latin hypercube. Then #CSP(S) < Eva I (5). 

Proof. For t € [to], let A( be the ground set of Si. 

Let I be an instance of #CSP(S) comprising a connected hypergraph G with vertices {vi, . . . ,v n } 
and v equality constraints. Note that this is without loss of generality - an instance / may be 
represented as a hypergraph G together with equality constraints in which equality is only applied 
to variables in the same connected component of G. 

For a positive integer p, construct a hypergraph G^ by combining G with vp(k — 1) new vertices and 
Ivp new edges: For j 6 [p] and i £ [u] add vertices 1? . . . , 11? , _ 1 . If the i'th equality constraint is 
= (v s , v t ) then add the 2p edges (v a , u\ x , ... , i^-i) and (v t , u\ x , ... , it^-i) for j € [p\. 

Now, suppose we are given the values a(v%),. . . , a{v n ) in Ag. By the Latin hypercube property, we 
can have (a(v s ), o-{u\^), . . . , c(^,fc_i)) € S and (er(«t), er(w^ 1 ), . . . , cr(u^ fc _ 1 )) € S only if o(v s ) = 
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a(vt). In that case, there are \A(\ k 2 choices for o~(u\ x ), o~{u\ k _ 1 ). So 

Z s {G [p] ) = Z Se (I)\A e \( k - 2 >. 

ee[-m] 

We can now use Corollary El □ 

The following lemma establishes the algebraic structure of the Si, using a result of Bulatov and 
Dalmau [3]. The proof itself has similarities to that of Palfy's theorem |14j (see, for example, [7]). 

Lemma 17. Suppose k > 3. Let S = S\ © • • • © S m be a symmetric k-ary relation on a ground 
set A such that, for each t G [m], Si is a Latin hypercube. Suppose Eval(S') is not jfP-hard. Then 
for each £ G [m], the relation Si is defined by an equation over an Abelian group Qt = (At,+) as 
follows: for some element a G An, (cti, . . . , «&) G Si if and only if a\ + ■ • • + = a. 

Proof. Suppose Eval(S') is not #P-hard. Fix I G [m], and fix any element an G A$ and denote it by 
0. If (a, f3, 7, 0, . . . , 0) G Se we will write 7 = a ■ f3. Then we will call (a, /3, 7) a triple and denote 
the set of triples by T#. We will call (a, (3, 7, 0, . . . , 0) G Si the corresponding padded triple. For 
given a and (3, the existence and uniqueness of 7 in a padded triple follows directly from the fact 
that Si is a Latin hypercube. Thus we may regard a ■ (3 as a binary operation on Ap, and hence 
At = (At, ■) is an algebra. By symmetry, the binary operation of At is commutative, and satisfies 
the identity a ■ {a ■ j3) = f3 for all a, f3 G A^. However, the operation is not necessarily associative. 

By Lemmad! #CSP(5') < Eval(S), so #CSP(5) is not #P-hard. Thus, by [3j, there is a Mal'tsev 
polymorphism if (a, /3,7) on A which preserves S. Recall that a Mal'tsev operation tp : A s — > A is 
any function which satisfies the identities (p{a, (3, (3) = (p((3, (3, a) = a for all a, (3 G A. We may use 
(p to calculate, as follows. Each line of a table is a triple in Tj>, and the Mal'tsev polymorphism 
implies that the bottom line is also a triple in Ti, using the fact that 99(0, 0, 0) = in the padded 
triples (which follows from the Mal'tsev property). Thus 



a 


7 


a ■ 


'7 




7 


P- 


'7 


7 





P- 


' 7 


A7) 





a 


•7 



and hence (p(a, 0, 7) = /3 ■ (a ■ 7) is a term of the algebra .4^. We have 

7) = /3 • (a • 7) = /? • (7 • a) = y?(7,/3,a), 

so (/? is a symmetric Mal'tsev operation (in the sense that it is symmetric in the first and third 
arguments). 

Define a new binary operation + on At by a + /? = <^(a, 0, /3) = • (a ■ ft). It follows immediately 
that + is commutative. Hence 

+ a = q + = 0-(a-0) = <p(a, 0, 0) = a, 

so is an identity for +. Denote 0-0 by 2 , and define —a by a ■ 2 . Then 

(-a) + a = a + (-a) = 0-(a-(a-0 2 )) = • (0 2 ) = • (0 • 0) = 0, 

so —a is an inverse for a. As usual, we write a — f3 for a + (—/?). 
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We have 

a 2 a ■ 2 

2 

P /3J) 

a + P /To a-0 2 

so a + /3 = (/3 • 0) • (a • 2 ) and since + is commutative, a + j3 = /3 + a = (a • 0) • (/3 ■ 2 ). Then 

a ■ /3 • 2 a + p 

2 

7-0 7 

</?(a-0,0 2 , 7 -0) /TO 2 (a + /3) + 7 

Therefore, since </? is symmetric in its first and third arguments, 

(a + /3) + 7 = ^(q-0,0 2 , 7 -0)-(/3-0 2 ) = y>( 7 • 0, O 2 , a • 0) • (/3 • 2 ) 

= (7 + /3)+« = a + (j + /3) = a + (£ + 7). 

The operation + is therefore associative, and hence the algebra C?£ = (Ag,+,— ,0) is an Abelian 
group. Hence, since —X is defined to be X-0 2 and a — 2 = — (— a + 2 ), we have, for any a,/3 € Ag, 

a - 2 2 -a + 2 

2 

o 2 p -p 

a i9 -a - i9 + 2 

where we used the fact that, by definition, tp(x, 0, y) = x + y. Thus a ■ /3 = —a — (3 + 2 ., and it 
follows that 

T £ = {(a,f3,-a-P + 2 )eAj:a,(3eA e } 

= { (a, /?, 7) £ A\ : a + /3 + 7 = 2 in ^} . (9) 

In particular, (a, —a, 2 ) G Tg for all a € Ag, and hence (0, 0, 2 ) € T f . It follows further that 

<p(a, 0, 7) = P ■ (a ■ 7) = -p - (a • 7) + 2 = -(3 - (-a - 7 + 2 ) + 2 = a - f3 + 7, 

so the Mal'tsev operation is the term a — P + 7 in the Abelian group Qg. 

Now assume by induction that the conclusion of the lemma is true for any S of arity less than k. It is 
true for arity 3 by ([9]), since then, for any I € [m], Sg = Tg. For larger k, suppose (ai, 02, . . • , ak) € 
Sg is arbitrary. Then, using the Mal'tsev operation and padding the triples (a±, — a±, 2 ), (0, 0, 2 ), 
we have 





a 2 




04 




a 1 


— OL\ 


2 














o 2 











a\ + a 2 


a 3 


«4 





Now the (k — l)-ary relation 

Sg = {("2, «3) • • • > «fc) G : (°' «2> «3j • • • ) °4) e M 

is symmetric and has the same Mal'tsev operation as S^. Thus we can define the same Abelian 
group Qg, and by induction we will have 

S> = {(a' 2 ,a' 3 ,...,a' fc ) G^T 1 : ^ k a> = a> in 
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for some a' G Ag. But we have shown that, for all (ai, ct2, 013, . . . , a^) G S^, we have (ai + 
CK2, 03 ... , afc) G S' e . Thus, since Gt is an Abelian group, 

Si = {((*!, (X2, a s ,..., ak) € A$ : Y^,j=i a j = a m &}j 
where a = a' , completing the induction and the proof. □ 

7 Proof of Theorem [4] 

Proof. Let g : D r — > Q~° be a symmetric function with arity r > 3. First, suppose that g is 
r-factoring and r-equational. Then applying Lemma Q31 with k = r, we find that, for connected G, 

Z°(G) = A f' ] ( G ) Z s ?\g). (10) 

ee[m] 

Now since 5 is r-equational, sj r ' is defined by an equation over an Abelian group (A^\+). Now, 

by |lll Lemma 13], Eval(S*| r ') is polynomial time solvable: The Abelian group is a direct product of 
cyclic groups of prime power. For each of these cyclic groups, we just need to count the solutions 
to a system of linear equations over the field Z p and this can be done in polynomial time (see |llj). 
Thus, Eval(Sy^) is in FP. To show that Eval(g) is in FP, it remains to show that A[ r '(G), as defined 
in ([7j), can be computed in FP. This is immediate over the number field Q(0, X^\, . . . , A^, ). In 
Section [8l we show that it can even be computed in FP over the number field Q(9). 

Suppose now that Eval(g) is not #P-hard. Then by Lemma [TUl g is both 2-factoring and 2- 
equational. Next suppose that, for some k G {3, . . . , r}, g is (k— l)-factoring and (k— l)-equational. 
Since Eval(g) is not #P-hard, we know that Eval(/t fc l) is not ^P-hard. By Corollary \T3[ g is k- 
factoring. Suppose, for contradiction, that g is not A:-equational. By Part (ii) of Lemma [TT| each 
is a Latin hypercube, so by Lemma [T71 Eval(5^) is #P-hard. By Lemma [T5l Eval(/t fc l) is #P-hard, 
giving the contradiction. So g is fc-equational. By induction, g is r-factoring and r-equational. 

It remains to consider the effectiveness of the dichotomy. For this, we must show that there is an 
algorithm that determines whether g is r-factoring and r-equational. This is nearly identical to a 
proof that the dichotomy in Theorem [2] is effective, however the notation is simpler in the latter 
context, so we provide this proof next. □ 

Lemma 18. The dichotomy in Theorem^ is effective. 

Proof. We must show that there is an algorithm that determines whether the conditions in The- 
orem [5] are satisfied. The connected components Di, . . . ,D m can easily be determined. Then, for 
each I G [m], there are a constant number of possibilities for the decompositions Di = Ai x \s(\ 
(I G [m]) which can all be checked, if necessary. Then, for the third condition, there are only a 
finite number of possibilities for the group structure, corresponding to the factorisations of \At\. 
Again, these can all be checked to see if any defines Sg, for each £ G [m]. 

For the second condition, for each £ G [m], we need to decide the satisfiability of a system of the 
form 

g((a 1> i 1 ),...,(a r ,i r )) = \i ) i 1 --\i > i r for all (ai, . . . , a r ) G St and h, . . . ,i r £ [s e ]. (11) 
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Thus we have 

M,i = g((ax,i), . . . , (a r ,i)) 1/r for all (a\, . . . , a r ) G Si and i G [se], (12) 
and hence (fTTj) is equivalent to the system 

r 

g((ai,ii),...,(a r ,i r )) r = JJ g((ai, ij), . . . , (ov, i,-)) 

i=i 

for all (ai, . . . , a r ) G and ii, . . . , i r G [sg], which can be decided in constant time by computation 
in the number field Q(8). □ 



8 Computation of Z 9 (G) in Q(0) 

Observe that jZJ), (fl~0|) and (fT2"j) seem together to imply that, in the polynomial time computable 
cases, we must compute Z 9 {G) in the number field Q(6, X± t i, . . . , \i )S1 , • • • , A m> i, . . . , A TO)Sm ), where, 
for £ € [m] and i € [s^], A^j = A[ r ] is an r th root of one of the original weights. This seems anomalous, 
since Z 9 (G) is actually an element of Q(0). We conclude by showing that the computation of Z 9 (G) 
can be done entirely within Q(#), as might be hoped. 

To do this, we must expand the expressions 

a ?(g)= n f>,^- 

veV(G) i=l 

To simplify the text, we drop the subscript t in the rest of this section, writing s for S£ and Aj for 
Xl t i and AM for A[ r '. Thus, we wish to expand 

a m (g) = n (xx> 

»eV(G) «=l 

The exponents of A« (i € [s]) in the monomials of the expansion of AM(G) are given by 

s 

$v,id v , where S v>i = l and <5 V)i € {0, 1} (i E [s],v eV(G)). (13) 

u£V(G) i=l 

Recall that M denotes the number of edges of G. Thus there are 0{M S ) possible monomials in the 
Ai, and the integer coefficient of each monomial Y\t=i \ * are gi ven by computing the number of 
solutions to systems of equations of the form 

s 

^2 5 v,i d v = M h where ^S Vti = 1 and S v>i G {0,1} (t G [s],v G V(G)). (14) 

v£V(G) i=l 

This can be done for all < M, < rM (i G [s]) in 0(nM s ) time by dynamic programming. An 
easy counting argument shows that Y2veV(G) = rM, so this returns a nonzero coefficient for the 
monomial Yli=i OIU y if Si=i M% = rM. Thus, in fact, there are at most 

( rA ?_ s r ') = a™- 1 ) 
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such monomials, which is clearly polynomial in the input size. 

Thus we can compute in FP a representation of (G) as a multivariate polynomial with monomials 
ULiK such that J2t=i M i = rM and Mi > (i £ [s]). We can express each such monomial 
in terms of the original weights, as follows. Let r%j {i € [s],j € [M]) be nonnegative integers such 
that Ylt=i r ij = r (i e [M]) and YljLi r ij = M (i € [s]). Such numbers always exist, though they 
will usually be far from unique, and can be computed in O(M) time. They are the entries of a 
contingency table with row totals Mj (i G [s]) and column totals r (j € [M]). See, for example, [8]. 
Now each column (j € [M]) can be interpreted as an r- multiset {iij, ■ ■ ■ , Vj} ^ [s], where i € [s] 
appears with multiplicity r^. Thus, choosing any (ai, . . . , a r ) E 5, we have 

s M s M M 

i=l j=l i=l j=l j=l 

using (jlip . This can be computed in O(M) time in Q(^), so Z 9 (G) can be evaluated in 0(M S ) time. 
The most demanding part of the computation seems to be the 0(nM s ) time needed to determine 
the relevant monomials by dynamic programming. But clearly all computations can be done in FP, 
and by working entirely within Q(#). 
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